Corpus: fin_wikipedia_2007_1M

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 102868 k-
2 71322 p-
3 68636 s-
4 63427 t-
5 53537 v-
Top Character Bigrams
word rank frequency n-gram
1 27466 ka-
2 24220 va-
3 20600 ko-
4 17562 ma-
5 16193 ta-
Top Character Trigrams
word rank frequency n-gram
1 7573 val-
2 6099 maa-
3 5496 kan-
4 5489 per-
5 4893 pää-
Top Character 4-Grams
word rank frequency n-gram
1 2937 kesk-
2 2790 kans-
3 2646 peru-
4 2444 puol-
5 2112 kirj-
Top Character 5-Grams
word rank frequency n-gram
1 2391 kansa-
2 2378 perus-
3 1735 toimi-
4 1497 keski-
5 1474 Quell-
13326 msec needed at 2017-12-14 05:41